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SYSTEM AND METHOD FOR EFFICIENTLY FINDING NEAR-SIMILAR 
IMAGES IN MASSIVE DATABASES 


ABSTRACT OF THE DISCLOSURE 


Massive amounts of multimedia data are stored in databases supporting web 
pages and servers, including text, graphics, video and audio. Searching and finding 
matching multimedia images can be time and computationally intensive. A method for 
storing and retrieving image data includes computing a descriptor, such an a Fourier- 
Mellin Transform (FMT), corresponding to a multidimensional space indicative of each 
of the stored images and organizing each of the descriptors according to a set similarity 
metric. The set similarity metric is based on Locality-Sensitive Hashing (LSH), and 
orders descriptors near to other descriptors in the database. The set similarity metric 
employs set theory which allows distance between descriptors to be computed 
consistent with LSH. A target image for which a match is sought is then received, and a 
descriptor indicative of the target image is computed. The database is referenced, or 
mapped, to determine close matches in the database. Mapping includes selecting a 
candidate match descriptor from among the descriptors in the database and employing a 
distance metric derived from the similarity metric to determine if the candidate match 
descriptor is a match to the target descriptor. 


